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1  RESEARCH 

The  objectives  of  the  contract  were  to  develop  new  algorithms,  data  structures,  and  parallel  algo¬ 
rithm  design  methodologies  for  SIMD  and  MIMD  parallel  computers. 

During  the  course  of  the  contract  we  reviewed  the  many  performance  metrics  that  have 
been  proposed  for  parallel  systems  (i.e.,  program  —  architecture  combinations).  These  include 
the  many  variants  of  speedup,  efficiency,  and  isoefficiency.  We  have  found  reasons  why  none  of 
these  metrics  should  be  used  independent  of  the  run  time  of  the  parallel  system.  The  run  time 
remains  the  dominant  metric  and  the  remaining  metrics  are  important  only  to  the  extent  they 
favor  systems  with  better  run  time. 

We  also  laid  out  the  minimum  requirements  that  a  model  for  parallel  computers  should 
meet  before  it  can  be  considered  acceptable.  While  many  models  have  been  proposed,  none 
meets  all  of  these  requirements.  The  BSP  and  LogP  models  are  considered  and  the  importance  of 
the  specifics  of  the  interconnect  topology  in  developing  good  parallel  algorithms  pointed  out. 
Our  findings  will  be  written  up  shortly  and  submitted  for  journal  publication. 

Effective  scheduling  of  parallel  computers  is  important  to  obtain  good  performance.  We 
have  developed  a  model,  the  master-slave  system,  to  accurately  model  the  scheduling  problem 
that  arises  when  a  program  running  on  a  host  processor  initiates  many  tasks  on  an  attached  mul¬ 
tiprocessor.  The  master-slave  paradigm  also  finds  application  in  semiconductor  testing,  machine 
scheduling,  transportation,  maintenance  management  and  other  industrial  settings.  In  the  master- 
slave  model  a  set  of  jobs  is  to  be  processed  by  a  system  of  processors.  Each  job  consists  of  a 
preprocessing  task,  a  slave  task  and  a  postprocessing  task  that  must  be  executed  in  this  order. 
The  pre-  and  post-processing  tasks  are  to  be  processed  by  a  master  processor  while  the  slave  task 
is  processed  by  a  slave  processor.  We  are  presently  developing  algorithms  for  this  model. 

Our  work  on  algorithm  development  for  different  architectures  focussed  on  sorting,  selec¬ 
tion  and  packet  routing  on  the  AROB  (Array  with  Reconfigurable  Optical  Buses)  model  as  well 
as  on  the  OTIS  (optical  transpose  interconnection  system)-Mesh  and  OTIS-hypercube  models. 
This  work  included  the  study  of  algorithms  for  the  important  bit-permute-complement  (BPC) 
class  of  permutations  and  for  matrix  multiplication. 

Towards  the  end  of  the  contract  period  we  focussed  our  attention  on  the  OTIS-Mesh  model. 
This  parallel-computer  model  uses  a  combination  of  electronic  and  optical  interconnect.  Short 
connections  are  realized  using  an  electronic  mesh  topolgoy  and  long  connections  are  realized  us¬ 
ing  the  optical  transpose  topology.  Efficient  algorithms  to  perform  many  fundamental  tasks  such 
as  sort,  rank,  shift,  window  sum,  consecutive  sum,  adjacent  sum,  and  random  access  reads  and 
writes  were  developed  last  year.  Algorithms  for  these  fundamental  tasks  can  be  used  to  develop 
efficient  algorithms  for  high-level  applications  such  as  those  that  arise  in  image  processing. 

We  completed  work  on  algorithms  for  various  variations  of  the  matrix  multiplication  prob¬ 
lem  (this  work  was  begun  in  the  preceding  reporting  period).  These  variations  include  vector  x 
vector,  vector  x  matrix,  and  matrix  x  matrix.  Our  work  is  aimed  not  only  towards  matrix  multi¬ 
plication  when  the  problem  size  matches  the  computer  size  but  also  when  the  matrix  size  is  much 
larger  than  the  computer  size.  In  addition,  we  have  developed  efficient  algorithms  for  image  pro¬ 
cessing  applications  such  as  histogramming,  histogram  modification,  image  shrinking  and  ex¬ 
panding,  and  the  Hough  transform. 

Details  of  our  algorithms  can  be  found  in  the  publications  that  have  resulted  from  this  con¬ 
tract. 
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